89 research outputs found

    The Unfulfilled Potential of Data-Driven Decision Making in Agile Software Development

    Get PDF
    With the general trend towards data-driven decision making (DDDM), organizations are looking for ways to use DDDM to improve their decisions. However, few studies have looked into the practitioners view of DDDM, in particular for agile organizations. In this paper we investigated the experiences of using DDDM, and how data can improve decision making. An emailed questionnaire was sent out to 124 industry practitioners in agile software developing companies, of which 84 answered. The results show that few practitioners indicated a widespread use of DDDM in their current decision making practices. The practitioners were more positive to its future use for higher-level and more general decision making, fairly positive to its use for requirements elicitation and prioritization decisions, while being less positive to its future use at the team level. The practitioners do see a lot of potential for DDDM in an agile context; however, currently unfulfilled

    The unfulfilled potential of data-driven decision making in agile software development

    Get PDF
    With the general trend towards data-driven decision making (DDDM), organizations are looking for ways to use DDDM to improve their decisions. However, few studies have looked into the practitioners view of DDDM, in particular for agile organizations. In this paper we investigated the experiences of using DDDM, and how data can improve decision making. An emailed questionnaire was sent out to 124 industry practitioners in agile software developing companies, of which 84 answered. The results show that few practitioners indicated a wide-spread use of DDDM in their current decision making practices. The practitioners were more positive to its future use for higher-level and more general decision making, fairly positive to its use for requirements elicitation and prioritization decisions, while being less positive to its future use at the team level. The practitioners do see a lot of potential for DDDM in an agile context; however, currently unfulfilled

    Robust Adaptive Control of an Uninhabited Surface Vehicle

    Get PDF
    In this paper, we develop a novel and robust adaptive autopilot for uninhabited surface vehicles (USV). In practice, usually asudden change in dynamics results in aborted missions and the USV has to be rescued to avoid possible damage to other marine crafts inthe vicinity. This problem has been investigated in our innovative design, which enables the autopilot to cope well with significant changes in the system dynamics and empowers USVs to accomplish their desired missions. The model predictivecontrol technique is employed which adopts an online adaptive nature by utilising three algorithms. Even with random initialisation,significant improvements over the gradient descent and least squares approaches have been achieved by the modified weightedleast squares (WLS) method, which periodically reinitialising the covariance matrix. Extensive simulation studies have been performed to test and verify the advantages of the proposed method

    A New Euler's Formula for DNA Polyhedra

    Get PDF
    DNA polyhedra are cage-like architectures based on interlocked and interlinked DNA strands. We propose a formula which unites the basic features of these entangled structures. It is based on the transformation of the DNA polyhedral links into Seifert surfaces, which removes all knots. The numbers of components , of crossings , and of Seifert circles are related by a simple and elegant formula: . This formula connects the topological aspects of the DNA cage to the Euler characteristic of the underlying polyhedron. It implies that Seifert circles can be used as effective topological indices to describe polyhedral links. Our study demonstrates that, the new Euler's formula provides a theoretical framework for the stereo-chemistry of DNA polyhedra, which can characterize enzymatic transformations of DNA and be used to characterize and design novel cages with higher genus

    An analysis of local and global solutions to address Big Data imbalanced classification: a case study with SMOTE preprocessing

    Get PDF
    Addressing the huge amount of data continuously generated is an important challenge in the Machine Learning field. The need to adapt the traditional techniques or create new ones is evident. To do so, distributed technologies have to be used to deal with the significant scalability constraints due to the Big Data context. In many Big Data applications for classification, there are some classes that are highly underrepresented, leading to what is known as the imbalanced classification problem. In this scenario, learning algorithms are often biased towards the majority classes, treating minority ones as outliers or noise. Consequently, preprocessing techniques to balance the class distribution were developed. This can be achieved by suppressing majority instances (undersampling) or by creating minority examples (oversampling). Regarding the oversampling methods, one of the most widespread is the SMOTE algorithm, which creates artificial examples according to the neighborhood of each minority class instance. In this work, our objective is to analyze the SMOTE behavior in Big Data as a function of some key aspects such as the oversampling degree, the neighborhood value and, specially, the type of distributed design (local vs. global).Instituto de Investigación en Informátic

    Big Data Fusion Model for Heterogeneous Financial Market Data (FinDF)

    Get PDF
    The dawn of big data has seen the volume, variety, and velocity of data sources increase dramatically. Enormous amounts of structured, semi-structured and unstructured heterogeneous data can be garnered at a rapid rate, making analysis of such big data a herculean task. This has never been truer for data relating to financial stock markets, the biggest challenge being the 7 Vs of big data which relate to the collection, pre-processing, storage and real-time processing of such huge quantities of disparate data sources. Data fusion techniques have been adopted in a wide number of fields to cope with such vast amounts of heterogeneous data from multiple sources and fuse them together in order to produce a more comprehensive view of the data and its underlying relationships. Research into the fusing of heterogeneous financial data is scant within the literature, with existing work only taking into consideration the fusing of text-based financial documents. The lack of integration between financial stock market data, social media comments, financial discussion board posts and broker agencies means that the benefits of data fusion are not being realised to their full potential. This paper proposes a novel data fusion model, inspired by the data fusion model introduced by the Joint Directors of Laboratories, for the fusing of disparate data sources relating to financial stocks. Data with a diverse set of features from different data sources will supplement each other in order to obtain a Smart Data Layer, which will assist in scenarios such as irregularity detection and prediction of stock prices

    Custom Hardware Versus Cloud Computing in Big Data

    Get PDF
    The computational and data handling challenges in big data are immense yet a market is steadily growing traditionally supported by technologies such as Hadoop for management and processing of huge and unstructured datasets. With this ever increasing deluge of data we now need the algorithms, tools and computing infrastructure to handle the extremely computationally intense data analytics, looking for patterns and information pertinent to creating a market edge for a range of applications. Cloud computing has provided opportunities for scalable high-performance solutions without the initial outlay of developing and creating the core infrastructure. One vendor in particular, Amazon Web Services, has been leading this field. However, other solutions exist to take on the computational load of big data analytics. This chapter provides an overview of the extent of applications in which big data analytics is used. Then an overview is given of some of the high-performance computing options that are available, ranging from multiple Central Processing Unit (CPU) setups, Graphical Processing Units (GPUs), Field Programmable Gate Arrays (FPGAs) and cloud solutions. The chapter concludes by looking at some of the state of the art solutions for deep learning platforms in which custom hardware such as FPGAs and Application Specific Integrated Circuits (ASICs) are used within a cloud platform for key computational bottlenecks

    Adaptive Semi-Supervised Classifier Ensemble for High Dimensional Data Classification

    No full text

    Optimization of the novelty detection model based on LSTM autoencoder for ICS environment

    No full text
    The recent evolution in cybersecurity shows how vulnerable our technology is. In addition, contemporary society becoming more reliant on “vulnerable technology”. This is especially relevant in case of critical information infrastructure, which is vital to retain the functionality of modern society. Furthermore, the cyber-physical systems as Industrial control systems are an essential part of critical information infrastructure; and therefore, need to be protected. This article presents a comprehensive optimization methodology in the field of industrial network anomaly detection. We introduce a recurrent neural network preparation for a one-class classification task. In order to optimize the recurrent neural network, we adopted a genetic algorithm. The main goal is to create a robust predictive model in an unsupervised manner. Therefore, we use hyperparameter optimization according to the validation loss function, which defines how well the machine learning algorithm models the given data. To achieve this goal, we adopted multiple techniques as data preprocessing, feature reduction, genetic algorithm, etc. © Springer Nature Switzerland AG 2019.Internal Grant Agency [IGA/FAI/2019/002]; Ministry of the Interior of the Czech Republic [VI20172019054
    • …
    corecore